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Abstract 

We prove that the problems of deciding whether a quadratic equation 
over a free group has a solution is NP-complete 

1 Introduction 

The study of quadratic equations over free groups probably started with 
Mal62 and has been deepened extensively ever since. One of the reasons 
research in this topic has been so fruitful is the deep connection between 
quadratic equations and the topology of surfaces. 

In this paper we will show that the problem of deciding if a quadratic 
equation over a free group is NP-complete. This problem was shown to be 
decidable in [CE81| . In addition it was shown in [Ql'89| . |GK92] . and [GL92] 
that if n, the number of variables, is fixed, then deciding if a quadratic 
equation has a solution can be done in time polynomial in the sum of the 
lengths of the coefficients. These results imply that the problem is solvable in 
at most exponential time. We will improve on this by proving (see Theorem 
12. 1[) that deciding if an quadratic equation over a free group has a solution 
is in NP. 

In [DR99 it is shown that deciding if a quadratic word equation has a 
solution is NP-hard. We will prove (see Theorem I3.10[) that deciding if a 
quadratic equations over a free group has a solution is also NP-hard. Our 
proofs are geometric, relying on the topological results of [Ql'89| and disc 
diagram techniques. 



2 Quadratic equations over free groups are 
in NP 

For a finite alphabet alphabet S we denote by S* the free monoid with 
involution with basis S and for u 6 S*, we denote by w -1 its involution. 
We denote by F(S) the free group on S. 
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2.1 Standard form 



A quadratic equation E with variables {xt,yi,Zj} and coefficients {wi,d} £ 
F(A) is said to be in standard form if its coefficients are expressed as freely 
and cyclically reduced words in A* and E has either the form: 

g m — 1 g 

n^ 1 '^] II z l^ w i z i d = 1 or JJ[a;i,j/i] = f (1) 

i=\ j~X i=X 

where [x,y] = x~ 1 y~ 1 xy, in which case we say it is orientable or it has the 
form 

g m — 1 g 

II x i II z^WjZjd = f or ]^[ = f (2) 

z = l j~X i=X 

in which case we say it is non orientable. The genus if a quadratic equation 
is the number g in JTJ and @ and m is the number of coefficients. If g = 
then we will define E to be orientable. If E is a quadratic equation we define 
its reduced euler characteristic, \ as follows: 



X(E) = 



2 — 2g if E is orientable 
2 — g if _E is not orientable 



It is a well known fact that an arbitrary quadratic equation over a free group 
can be brought to a standard form in time polynomial in its length. 



2.2 Ol'shanskii's result 

In sections 2.3 and 2.4 of [Ql'89l it is shown that a quadratic equation E in 
standard form has a solution if and only if for some n < 3(m — x{E)), 

(i) there is a set P — {pi, . . -Pn} of variables and a collection of m discs 
Di, . . . , D m such that, 

(ii) the boundaries of these discs are directed labeled graphs such that each 
edge has a label in P and each pj £ P occurs exactly twice in the union 
of boundaries; 

(iii) if we glue the discs together by edges with the same label, respecting 
the edge orientations, then we will have a collection Eo, . . . , E; of closed 
surfaces and the following inequalites: if E is orientable then each E, 
is orientable and 

(£x(£i))-2/>x(£) 

if E is non-orientable either at least one E; is non-orientable and 
or, each Ei is orientable and 

-2l>x(E) + 2 

and 
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(iv) there is a mapping P — > A* such that upon substitution, the coeffi- 
cients Wi,... ,w m -i and d can be read without cancellations around 
the boundaries of D\, . . . , D m -\ and D m , respectively; and finally that 

(v) if E is orientable the discs D\ , . . . , D m can be oriented so that Wi is read 
clockwise around dDi and d is read clockwise around dD m , moreover 
all these orientations must be compatible with the glueings. 

We note that the bounds in (iii) are not given explicitly in that paper, but 
they follow immediately from the discussion on cutting up diagrams into 
so-called simple diagrams, see [Ql / 89l for details. 



The result of section I2T21 enables us to construct a good certificate. 

Theorem 2.1. For a quadratic equation E in standard form there is a 
certificate of size bounded by 2(|toi| + . . . + |u; m | + \d\ + 3(2g + m)) that can 
be checked in polynomial time. 

Proof. The certificate will consist of the following: 

1. A collection of variables P — {pi, . . . ,p n }- 

2. A collection of substitutions tp = {pi i— » ou, i = 1 . . . n) where ai 6 A* 
and n < 3(2g + m). 

3. A collection of words in P* 



with pij £ P, eij 6 { — 1, 1} and each pi £ P occuring exactly twice. 

The C[s are represent the labels of the boundaries of the discs D\, . . . Di so 
checking contitions (i) and (ii) of Section \2. 21 can be done quickly, moreover 
we see that the size of C is at most 2n < 6(2<? + m). 

ip extends to a monoid homomorphism ip : P* — > A*, (iv) can also be 
verified quickly since for i = 1, . . . m — 1 we just need to check that some 
cyclic permutation of ip(Ci) is equal to Wi and some cyclic permutation of 
ip{Cm) is equal do d. Moreover, since the equality is graphical we have that 



Therefore the size of the certificate is bounded as advertised. All that is left 
is to determine the topology of the glued together discs. We describe the 
algorithm without too much detail. 

Step 1: Built a forest of discs: We make a graph F such that each ver- 
tex Vi £ V(T) corresonds the disc Di and each edge ej € E(T) corresponds 
to the variable pj £ P. The edge et goes from Vi to Vj if and only if the 
variable pt occurs in the boudary of Di and in the boundary of Dj or if 
i = j then there are two different occurences of the variable pt- We con- 
struct a spanning forest T. This enables us to count the number of connected 
components Eq, . . . , Ej. 



2.3 The certificate 




\ai\ + . . . \a n \ < \wi\ + ...+ \w m \ + \d\ 
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Step 2: Determine orientability: For each maximal tree T r C T we get 
a "tree of discs" by glueing together only the pairs of edges whose labels 
correspond to elements of E(T r ). The resulting tree of discs is a simply 
connected topological space that can be embedded in the plane and we can 
read a cyclic word c(T r ) in P* along its boundary. The surface E r obtained 
by glueing together the remaining paired edges of the tree of discs will be 
orientable only if whenever p* 1 occurs in c(T r ) then pj 1 also occurs. We 
can also check (v) at this point. 

Step 3: Compute Euler characteristic: The identification of the boundary 
of the discs with graphs, enables us to think of the discs as polygons. If a 
disc Di has Ni sides then we give each corner of Di an angle of n(Ni — 2)/Ni. 
Then for each tree of discs produced in the previous step, we identify the 
remaining pairs of edges to get the surfaces Eo, . . . Ej, which now have an 
extra angular structure. To each Ej , we can apply the Combinatorial Gauss- 
Bonnet Theorem which states that for an angled two-complex X, 

fexm „ex(o) 

where X' 2 ' is the set of faces and X^ is the set of vertices. This angle 
assingment gives each face / a curvature n(f) = and each vertex has 
curvature 

k(v) = 2k-( Z ( c )) 

i.e. k(v) is 2-7T minus the sum of the angles that meet at v. 

With an appropriate data structure one can perform steps 1-3 (not neces- 
sarily in sequential order) in at most quadratic time in the size of C. Once all 
that is done, verifying the inequalities of (Hi) is easy and we are finished. □ 

3 Quadratic equations over free groups are 
NP-hard 

We will present the bin packing problem which is known to be NP-complete 
and show that it is equivalent deciding if a certain type of quadratic equation 
has a solution. 

3.1 Bin Packing 

Problem 3.1 (Bin Packing). 

• INPUT: A k— tuple of positive integers (n, . . . , ru) and positive integers 
B,N. 

• QUESTION: Is there a partition of {1, ... , k} into N subsets 

{1, ...,fc} = Si U .. . US N 
such that for each i — 1, . . . , N we have 

£ rj< B (3) 
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This problem is NP-hard in the strong sense (see [GJ79] p. 226), i.e. there 
are NP-hard instances of this problem when both B and the Tj are bounded 
by a polynomial function of of k. 

Let t = NB — Yli=i Ti - Then by replacing (n, . . . , ru) by the k + i-tuple 
(ri, . . . , Tfc, . . . , 1, . . . , 1) we can assume that the inequalities © are actually 
equalities. This modified version is still NP hard in the strong sense. We 
state it explicitly: 

Problem 3.2 (Exact Bin Packing). 

• INPUT: A k— tuple of positive integers (n, . . . , rj.) and positive integers 
B, N. 

• QUESTION: Is there a partition of {1, . . . , k} into N subsets 

{1, ...,k} = Si U ... USn 
such that for each i = 1, . . . , N we have 

The authors warmly thank Laszlo Babai for drawing their attention to 
this problem in connection to tiling problems. 

3.2 Tiling discs 

Throughout this section we will consider the discs to be embedded in E 2 and 
will always read clockwise around closed curves. 

Definition 3.3. A [a, b n ]-disc is a disc as in section [2721 along whose bound- 
ary one can read the cyclic word [a, b n ]. 

Definition 3.4. A [a,b n ]-ribbon is a rectangular cell complex obtained by 
attaching [a, 6 J ]-discs by their a-labeled edges, such that we can read [a, b n ] 
along its boundary. The top of an [o, b n ] ribbon is the boundary subpath 
along which we can read the word b~ n , the bottom is the boundary subpath 
along which we can read the word b". 

Definition 3.5. Let D be a disc tiled by [a, 6"]-discs, we define the a—pattern 
of D to be a graph defined as follows: 

1. In the middle of each a-labeled edge put a vertex. 

2. Between any two vertices contained in the same [a, 6"]-disc draw an 
edge. 

Connected components of a— patterns are called a— tracks 

Lemma 3.6. A disc D tiled by finitely many [a, b n ]-discs cannot have any 
circular a— tracks. 

Proof. It is clear that every a— track is a graph whose vertices have valency 
at most 2. If an a— track t has vertices of valency 1 then they must lie on 
dD. 

Suppose towards a contradiction that D has a circular a-track c. Then c 
divides D into two components: an interior and an exterior. If we examine 
the interior we see that it is a planar union of discs with only the letter b 
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occuring on its boundary, it follows that the interior contains a disc D' with 
circular a-track. Repeating the argument we find that D must have infinitely 
many cells which is a contradiction. □ 

Corollary 3.7. If D is a disc tiled by finitely many [a, b n ]-discs, then the 
cyclic word read around dD cannot contain only the letter b. 

Corollary 3.8. We cannot tile a sphere with finitely many coherently ori- 
ented [a, b n ]-discs. 

Proposition 3.9. Suppose that D is a disc with boundary label [a , b B ] that 
is covered by [a, b"}- discs, then it is obtained from a collection of M [a, 6 s ]- 
ribbons Ri,...Rm such that the bottom of R i+1 is glued to the top of Ri, 
i = l,...M. 

Proof. We procede by induction on TV. If N = 1, then we consider the 
a— track t starting at one of the edges of dD labeled a. t must touch the 
other edge labeled a in dD. Let R(t) be the subset of D consisting of 
the [a, 6 n ]-discs that t intersects. We note that R(t) can be obtained by 
making some identifications in the top and bottom of some [a, 6 fl ]-ribbon, 
but R(t) C D, which means on one hand that if R(t) is not simpy connected 
then some subset of dR(t) is a circle that bounds a disc inside D, this disc 
can only have b's in its label contradicting Corollary 13.71 It follows that 
R{t) is a ribbon and it contains every a— labled edge in D, so we must have 
R(t) = D. 

Suppose the hypothesis held for all L < TV — 1 and suppose that we 
could read [a N ,b B ] along dD. We divide dD into four arcs l a ,tb,r a ,bb that 
have labels a- N ,b- B ,a N ,b B respectively, i.e. the left, top, right and bottom 
sides. Let e be the edge with label a that touches the vertex between l a and 
tb- Let t be the corresponding a— track. Let R(t) be as above, since DcE 2 
it is easy to see that t cannot be a line from l a to l a , therefore t must go 
from l a to some edge e' in r a . 

Suppose towards a contradiction that e' was not the edge in r a that 
touched the vertex v between tb and r a . Let / be the edge in r a that touches 
v, and let u be the corresponding a— track, since a-tracks cannot cross we 
have that u must also end in r a which is a contradiction. 

By the same argument as in the case TV = 1 we have that R(t) must be 
a embedded ribbon. By Corollary 13.71 we must have that tb is contained in 
the top of R(t), which means that R(t) is an embedded [a, 6 B ]-ribbon and 
if we remove R(t) from D, then what remains is a disc D' such that we can 
read [a N ~ 1 ,b B ] along the boundary. So by induction the result follows. □ 

3.3 A special genus zero quadratic equation 

Equipped with Proposition 13. 91 we shall deduce NP hardness of the following 
equation: 

k 

WzJ^a^z^^y] (5) 

i=i 

By the results in section 12.21 ((5} has a solution if and only if there is a 
collection of discs Dj with boundary labels [a, 6™ J ] for j = 1 . . . k respectively 
and a disc D m with boundary label [a N ,b B ] such that, glued together in a 
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way that respect labels and orientation of edges, form a union of spheres 
(this is forced by the first inequality in (iii) , section 12. 2} . 

Theorem 3.10. Deciding if the quadratic equation J3|) with coefficients 

[a, b ni ], . . . , [a, b" k ] and [a N ,b B ] 

has a solution is equivalent to deciding if vroblem \3.3k with input (m, . . . , n m ) 
and positive integers B, N; has a positive answer. 

Proof. "Bin packing =>• solution." Suppose that Problem 13.21 has a positive 
answer on the specified inputs. For each subset Si of the given partition 
of {1, ...,k} we form a [a, 6 s ]-ribbon Ri by glueing together the [a, b nj ]- 
discs for j £ Si, this is possible by (iv) in section T2.2I and equation @. 
We then construct one hemisphere by glueing the ribbons Ri, . . . , Rm ■ The 
other hemisphere is the remaining disc with boundary label [a N ,6 s ] -1 , the 
resulting sphere proves the solvability of (JSJ with the given coefficients. 

"Solution =>■ bin packing." If ((5JI has a solution then there is a union 
of spheres tiled with [a, 6 n ']-discs and one [a , fc s ] _1 -disc, moreover these 
discs are coherently oriented. By condition (v) and Corollary 13.81 there can 
only be one sphere: the sphere So containing the unique [a N , 6 s ]~ 1 -disc. 
If we remove this [a N , & s ] _1 -disc from So what remains will be a disc D 
with boundary label [a N ,b B ] tiled with [a, fe™»]-discs. Applying Proposition 
13.91 divides D into ribbons Ri, . . . Rn and we immmediately see that these 
ribbons provide a partition of {ni, . . . n^}, showing that Problem 13.21 has a 
positive solution on the given input. □ 
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